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ABSTRACT 



The authors develop a model of undergraduate student 
attendance that relies on five parameters, one of these 
being a parameter of total work, w , required to 
complete the degree. An enrollment forecasting method 
consistant with these attendance patterns is developed 
and compared with data for the period, 1961-1966, and a 
cohort of 2126 and 3298 freshmen entering in the Fall 
semesters of 1955 and 1960, respectively. 

Under the assumptions of the model, the probability of 
graduation is shown to be the wth power of the condi- 
tional probability of successful completion of a unit 
of work given that a student drops out or attends and 
successfully completes a unit of work* 



I. INTRODUCTION 



(1) Formulation 

In several related papers (Oliver and Marshall (1969); Marshall, Oliver 
and SusloWy (1969)) the authors attempt to correlate data obtained from 
analysis of student attendance patterns with enrollment forecasts made over 
successive time periods. In both of these papers the authors rely heavily 
on linear models in which transition probabilities are used to estimate the 
movement of students between grades • Thus, it is possible to make use* of 
results which can be derived from the theory of Markov Chains (Bartholomew 
(1967); Gani (1963); Feller (1957)). 

In these models and the data that supports them there is a very appar- 
ent break point in the attendance patterns of students at the end of the 
8th semester (4th year) after initial registration* The effect can be seen 
in Figure (1) which shows the fraction of students having consecutive atten- 
dance patterns over a twelve semester period. ^ 

The purpose of the present paper is to propose a simple attendance and 
enrollment model based on ^'constant work" that is required of students before 
they can obtain a degree. We show that most of the relevant qualitative 
char.^cteristics of undergraduate student enrollments and attendance patterns 
at the Berkeley campus of the University of California can be analysed and 
predicted with reasonable accuracy by such a model. 

The probability model that we propose is simple in several respects: it 
assumes that, except for a quantitative measure of work completed towards a 
degree, a student m^akes a decision to attend, vacation or drop, out with proba-* 
bill ties p, q, and r respectively, such that p+q+r « 1. These probabili 
ties are independent of time or whether the student is freshn^an, sophomore. 
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junior or senior. If a student attends the university, the conditional 
probability that he will complete one unit out of a total of w units of 
work is s ; the conditional probability that he fails to comp? -ite a unit of 
work in the semester is 1-s . The following description may help clarify 
our use of the words, "attend'S "vacation" and "drop-out"- An attending 
student is enrolled in the university. The term, "drop-out" is used to denote 
a student that permanently leaves the system before graduation, independent of 
his academic standing. We distinguish our use of this word from the more 
common usage which has the connotation of poor academic performance. A student 
on vacation is not a "drop-out" nor is he enrolled; in the next semester he may 
elect to stay on vacation, attend or drop-out. 

(2) Notation 

Besides the four probabilities mentioned above and the constant work 

parameter w , we will denote the random number of consecutive attendances 

by N • and the random number of semesters that elapse between initial 
^ c 

registration and graduation or drop out by N . Furthermore, is the 

event: attend the n^^ semester after initial registration; is the event: 
vacation during the n^^ semester after initial registration, and is the 

event: graduate or drop out from the university on or before n^^ semester. 
Since these events are mutually exclusive and exhaustive. 



Pr{A } + Pr{B } + Pr{C } = 1. <1) 
n n n 



In our sample spaces graduation or drop out is guaranteed; thus. 



Lim rr{C } » 1. <2) 
n 

n->» 
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The relevant distributions of attendance, consecutive attendance, drop out 
and' graduation, as well as the linear equations for expected future enroll- 
ments, are derived in Sections II and III, Our paper concludes by comparing 
some theoretical results with statistical data obtained from an examination 
of a cohort of 3298 freshtnen enrolled for the first time in the Fall semester 
of 1960, and 2126 freshmen from 1955, at the Berkeley campus of the Univer- 
sity of California. 
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II. ATTENDANCE PATTERNS 

(1) Consecutive Attendances 

The second through fifth columns of Table 1 show the number and frac- 
tion of those students that entered ac freshmen in the Fall semesters of 
1955 and 1960 and completed each consecutive semester with no interruptions 
in attendance (Suslow, 1968). These fractions are plotted-in Figure (1). 
More will be said about this graph and other statistical data on student • 
attendance patterns in companion papers but, we now wish to point out and 
investigate the very apparent break point at the eighth semester. We" believe 
that this characteristic structure may be the result of students having a 
"constant work" requirement. Such a concept is inherent^in certain Markov 
models that we discuss in other papers but our interest here is to develop a 
simple independent trials model that clearly illustrates this basic point. 
It is also interesting to note that the 1955 and 1960 data indicate station- 
ary student attendance patterns. 



ERIC 



Number 
of 

Consecutive 
Semesters 
Completed 



0 
1 

2 
3 
4 
5 
6- 
7 
8 
9 
10 
11 



1955 Entering Freshmen 
(2,126 students) 



Number of 
Students 



2,126 
2,067 
1,923 
1,554 
1,373 
1,112 
1,027 
883 
819 
222 
112 
15 



Per cent 



100 

97 
90 
73 
65 
52 
48 
41 
38 
10 
5 
1 



1960 Entering Freshmer 
(3,298 students) 



Number of 
Students 



3,298 
3,228 
2,994 
2,301 
2,018 
1,679 
1,554 
1,371 
1,291 
363 
181 
33 



Per cent 



Table 1: 



100 

98 
91 
70 
61 
51 
47 
42 
39 
11 
5 
1 



Students Completing Each Consecutive Semester 
With No Interruptions in Attendance 
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We assume that as long as a student has not completed w units of 

work, the probability of attending a semester is p . Let N be the 

c 

number of consecutive attendances of a student after he first enters. For 
the first w semesters, any break in the attendance pattern can be caused 
only by a student vacationing or dropping out. Hence, 



Pr{N 1 n} = p" n = 0,1,2, , w. 



<3) 



For a student to attend at least w+k semesters without interruption, he 
must attead all w+k of them and successfully complete no more than (w-1) 
.in the first (w+k-1) semesters. Hence we see that the probability, 

w-1 



Pr{N^ > w+k} = p^^ X:C^j"^)s^<l-s) k = 1,2 



(4) 



j=0 



is the product of a geometric and incomplete binomial term. We can rewrite 
this expression 



Pr{N^ 1 n} = p 



= P 



n 



vn-l-j 



n - 0,1,2, . . . . w 

n = w+1, w+2, (5) 



Perhaps the most revealing way to show the effect of the w semesters of 
work is to ask for the conditional probability that n+1 consecutive 
semesters will be attended given that n have already been attended: 



Pr{N > n-f 1 I N > n} 
c — ' c — 



= P 



= pd-s'') 



_l-s^ (1 -h w(l-s)) 

= p ■ — 

1-s 



1 < n < w 



n = w 



n * w+l . 



<6) 
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A calculation of these conditional probabilities for the first ten terms 



when w » 8, p « .89, s ^ .953 yields the sequence, .89, .89, .89, ,89, 
.89, .89, .89, 0.32, 0.18 and indicates how rapidly the conditiomil proba- 
bility for the first nine terms of continuous attendance decreases after the 
eighth semester. In the first eight semesters, the fraction of continuous 
attendance students decreases geometrically but, beyond eight, the drop is 
much more rapid. The curve exhibited by the real data in the 1 to B semester 
range is due to the dependencies between states which are igno i w.iis 

simple model and is the subject of more detailed analysis in the two papers 
to which we have already referred. 



Semester 


1955 data 


I960. data 


* 


0 


1.00 


l.CD 


1.00 


1 


.97 


.98 


.89 


2 


.90 


.91 


.79 


3 


.73 


.70 


.70 


4 


.65 


.61 


.63 


5 


.52 


.51 


.56 


6 




,47 


.49 


7 


.4 


■ .42 


.44 


8 


.38 


. .39 


.39 


9 


.10 


.11 


.11 


10 


.05 


.05 


.02 


11 


.01 


.01 


.00 


*p = 0.890 


s = .953 







Table 2: The Comparison of Equation (5) with 
1955 and 1960 Cohort Data. 



(2) Attending and Vacationing Students 

Attendance in any one of the first w semesters simply requires 
that (a) the student attend and (b) he not drop out prior -:o that semester. 
Thus: 
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Pr{A^} = p 



Pr{A,} = (l-r)p 



Pr{A^} = 



(7) 



th, 



Attendance in the n (n^w) semester requires that no more than w-1 units 
of work be completed up to that point and, furthermore, that the student 
not drop out . Thus : 
w-1 

^''^^V = ("j^)^P^^^ (q+p(l-s)) n=w, wfl, wf2, ....,.(8) 



The first p on the right hand side ensures the students' presence on the n 

.n-1 



th 



sf..aester. Factoring out the term (1-r) 
Equation (8) yields the expression: 

.n-1 



from the right hand side of 



PrfA^} - P(l-r)' 



w-1 /n-l\ , 



n > 



1-r 



(9) 



which, of course, reduces to Equation (7) when n«w. The factor within 
square brackets is an incomplete binomial sum which is less than one; thus, 
the probability of attendance decreases more rapidly for n >^ w than it 
db^^^ for n <^ w. 

The probability of being on vacation on semester a is simply obtained by 
substituting q for p in Equations (7) and (9), 



Pr{B^} « q(l-0 



n-1 



1 < n < w 



(10) 



w < n . 
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(3) Fraction Graduating and Total Elapsed Time 

The fraction graduating from a freshman class is easy to obtain from 

the distribution of waiting time (in semesters) until graduation. Clearly, 
an entering freshman can not graduate in anything less than w semesters if 
he advances at the maximum rate of one unit of work per semester. Thus, if 
N is the random number of semesters to graduation, 

o 

Pr{Ng - n} = 0 n « 1,2,3, w-1 . (U) 

To graduate at the end of the n^^ semester (n>w) , the requirement is 
simply that in n-1 previous semesters, w-1 units of work are completed 
successfully; in the remaining n-w semesters, the student either does not 
attend or, if he attends, he fails to complete a unit of work; finally, he 
must successfully complete the last semester. Except for the last semester, 
the exact timing of failures or vacations is immaterial to the completioi; of 
work. Thus, the waiting time to graduation has the negative binomial distri- 
bution: 

Pr{Ng-n} = ps[(^3j)(ps)''"\q+p^(l.s))"'"] n-w, wfl. (12) 

By adding the terms in Equation '12) for all n»^w, we obtain the probability 
of ever graduating. To obtain this sum we make use of the identity: 

00 

E ('^t"^)x''(l-x)^ = 1 if Olx<l. (13) 
k=0 

and obtain the result, 

yPr{N «n} 
n«w 



k-O 



Thus, the probability of ever graduating is 



w 

Pr{ graduation} = (^J^) • (1*) 



The probability of graduating or dropping out on or before semester 

n is obtained by noting that for n<w-l no one graduates, and 

n . , 

Pr{C } = Frd-r)^'-^ = l-(l-r)" . l<n<w-l . (15) 

This probability is one minus the tail distribution that drop-out occurs 
after the n^^ semester. The probability distribution for nxw is only 
slightly more complicated* If N is the random variable measuring the 
total number of semesters that elapse between registration and graduation 
or drop-out, we make use of the cumulative distribution of N , 



2 Pr{N«j} « Pr{C^} (16) 
j-1 



and Equations (12) and (8) to obtain 

Pr{N-n} - ps|(;;:J)(ps)"-'(q+p(l-s) + ("j') (ps)^ (q+Pd's)) ""'"^ ] • 

By factoring (1-r)""''' terms and sunuaing the incomplete binomial expressions; 



Pr{C } - l-(l-r)" 
n 



n-1 



1- E (":M(x)^(i-s)"-^-J 

Thus, the requirement of Equation (1) is satisfied- 



w>n; x» 7^ . (17) 
- 1-r 
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(4) Correlation with Cohort Data 

The purpose of this section Is twofold: (1) to show how parameters 
estimated from certain statistics of the freshman cohort population can be 
used to make theoretical computations of the distributions that we have 
already derived In previous sections, and (11) to show how these theoretical 
computations agree with Independently obtained data on the attendance patterns 
of undergraduate students* The reader should bear In mind that we have not 
attempted to take Into account the more realistic and difficult Interdepen- 
dencles which arise as students proceed through grades; their attendance 
patterns are affected by performance to date, change In majors, curricula and 
so on; none of these important effects are explicitly calculated. 

The manner In which we estimate the parameters p, q, r and s Is 
stra^^ht forward. Since the distribution of consecutive attendances does 
not explicitly Involve the parameters r or q , we used the slope of this 
curve (Figure 1) before and after w«8 to estimate p and s . From 
Table 1 and Equation (5) we see that 

Pr{N >8} = p^ «= 0.390 . 

At n-9, we require that 

p^(l-s^) » 0.111 

From these equations we estimate that p » .890 and s » .953. 

Using these values of p and s In equation (14), we find the 
probability of graduating as a function of r , and tabulate these probabll-^ 
Itles In Table 3. Notice that the probability of graduation Is highly 
sensitive to the drop-out probability r even when ps Is close to 1. A 
change In r of 2 per cent from .06 to .08 leads to a 9 per cent drop In 
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graduation. 







r- 






\ r+ps 1 


0.00 


1.000 


0.02 


0.830 


0.04 


0.692 


0.06 


0.579 


0.08 


0.486 


0.10 


0.410 



Table 3: Probability o£ Graduation as a Function 

of the Drop-out Probability. (p=.890, s=.953) 

Berkeley campus records (Suslov, et al (1968)) indicate that 52 per cent of 
the entering freshmen in the 1955 and 1960 group eventually graduate. To 
estimate r , we therefore require that 

- 0.52 . 

\r+ps j 

Since (ps) ■ .848, this equation gives r = .072. Also, since p+q-fr = 1, 

we have q - .038, q-fp(l-s) = .080. Using these values, we calculate 

Pr{A } in Figure (2). The solid line is the theoretical curve predicted by 
n 

Equations (7) and (9); the 1955 and 1960 cohorts are represented by the data 
points. Notice that attendance of students in the early years is under- 
estimated while attendance in the fourth through seventh semesters Is over- 
estimated* However, the rapid decrease in the probability of attendance in 
the ninth, tenth and eleventh semesters seems to agree well with the data. 



.40 



.30 



.20 



0.10 • ^^^^ 

* 1960 Data 

— Theoretical 



.09 



•08 p « .890, r - .072 

07 8 = .958 w -8 

.06 
.05 

.04 
.03 



.02 



Figure 2 : The Probability of Attendance 
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III. ENROLTilENT FORECASTS 

(1) The Forecasting Model 

Denote the expected number of students at semester t that have 
completed j units of work by x^(t) . By this definition, enrollments 
include both vacationing and attending students. If a student attends and 
successfully completes a semester (with probability ps) , he moves to a 
group with one more unit of woik completed. If he attends and fails, or 
goes on vacation (with probability q+p(l-s) ), his level of work remains 
unchanged. Of course, if he drops out or graduates in a semester, he is not 
- counted in the next. Let (t) be the new admissions at level j into 

; semester t . Thus, the equations of motion for enrollments in semester t+1 

I in terms of enrollments in semester t are 

I Xj(t+1) - psxj^^(t) + (q-fp(l-s))Xj(t) + yj(t+l) (18) 

1 ' \ • ' J « 1,2> . . . , w-1 , 

I' ^ 

I t 0,1,2, 

|. with the special case, 

I XQ(t+l) - (q+p(l-s))xQ(t) + yQ(t+l), 

t 

I for those students who do not complete any units of work. Equation (18) can 

I be solved recursively for XQ(t+l) in terms of the sequences: 

I 

I (yj(i)), ((yj(2)), (yj(t+i)) c<j<w-i (i9) 

I and, given initial conditions, 

I 

I x^(0), X2(0), Vl^°^- (20) 
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With X(t) 3 (Xj(t)) and Y(t) « ^yj^^)) representing the row vectors of 
expected student enrollments and admissions, and P representing the 
transition probabilities, Equation (18) can be written in matrix form as 



X(t+1) = X(t)P + Y(t+1). (21) 

Again, X(0) = (Xj(0)) is given. It is well known (Feller, (1968); 
Bartholomew, (1967)) that as Y(t) approaches a limiting vector Y *#ith 
constant growth rate, the limiting enrollment vector can be written: 



X - Y(I-'P)'"''- . (22) 



The inverse of (I-P) exists because all rows except the last sum to 

l~r<l and elements in the las*t row corresponding to j=w-l sum to 

th - 1 

l-r-ps<l. The elements of the j column of (I-P)" can be explicitly 

obtained in powers of ^ ^^^ j 1 thus one can write the expected number of 

students with j units of work as: 

This steady-state formulation of the enrollment model is particularly 
interesting because we are now in a position to show that characteristics 
of students predicted by Equation (23) agree with theoretical results we 
have already derived in Section II for attendance patterns of entering 
students. Thus, an experimental and theoretical link between student 
attendance and enrollment patterns can be clearly established. 

To illustrate the consistency of these results > we consider the 
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special solution of Equation (23) when * 1 and Vj * 0» i^^' The 
expected number that graduate and drop out In the t^^ semester are given 
respectively by 

w-1 

psx„ . ity and r][]x (t). (24) 
Substituting Equation (23) into (24) yields the result: 

Pr{ graduation} = ps(r+ps)-^ y^(^) ' = (^) (25) 

which agrees with Equation (14)). 
Furthermore y 

w-1 

Pr{ drop-out} = r53 (r+ps)"^ yo(^^) ^ * ^^^^ 
By making use of the formula » 

w 

l+a+a^+.-.+a*'"-^ - . (27) 

I'-a 



we obtain 



tW 



« 1 • Pr{ graduation} . (28) 
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(2) Forecasts of Lower and Upper Division Students in 1961-1966 

As an independent experimental confirmation of the basic structure of 
our constant work model, we made forecasts of returning lower division (fresh- 
men plus sophomores and upper division students (Juniors plus seniors) in the 
five-year period, 1961-1966. Admission data for new students is given in 
Table (4). Recall that the forecasting problem associated with Equation (18) 
is one of predicting vectors X(t) for a given sequence of Y(t) vectors. 
No attempt has been made in this paper to develop or test models forecasting 
new admissions per se. 

The choice of X(0), initial enrollments, is difficult since data on 
new admissions, attending, vacationing or returning students has never been 
cdllected or classified in the units of work natural to our model.. Somewhat 
arbitrarily we ehose to assign the work indices 
j = 0,1 to freshmen 
= 2,3 to sophomores 

= 4,5 to juniors w=8 
= 6,7 to seniors 
and selected the fractions 



9- - 0.890 — - .785 



«0^1 "Z-" ""3 



(29) 



= .768 = .714 



to indicate the split of freshmen, sophomores, juniors and seniors into work 
levels for the year 1961. 

A good allocation of new admission data in Table (4) into the appro- 
priate work levels is also not obvious; we chose to allocate all fall admissions 
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into the even numbered work levels, and all spring admissions into the odd 
numbered levels based on our intuitive feeling that spring entrants would 
wait for a fall admission date if their completed work level were such that 
they could be in phase with the normal course curricula offered at that time. 

Our model assumes that new admissions and returning students are 
counted at one point in time whereas, in reality, approximately fifteen weeks 
elapse between the beginning and end of a semester; attrition of returning 
students, drop-outs and new admissions leave and enter the system at many 
non-overlapping periods of time. To simplify the structure of our accounting 
system, we assumed that all new admissions to -semester t attend that 
semester. Hence, vacationing students can only be drawn from the population 
of students that have been enrolled in at least one prior semester. We know 
that in real life, many new admissions leave before the semester is completed; 
it is difficult to include such, effects «t this time because none of our 
"University records have ever considered a separate category of vacationing 
students. 

From Equations (18) and (21) we see that the expected number on vacation 
at the t^^ period is 




(30) 



Subtracting this number from X(t) yields the expected number attending at 
the t^^ semester; 



X(l) + X Y(t). (31) 
qip qfp 

Using the new admissions of Table (4), we obtained initial values, 

er|c 
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X(0) » X(1961), by requiring that the number of attending students calculated 
from Equation (31) agree with the registrar records for students enrolled in 
the fall of 1961. These figures are shown in the first column of Table 5-. 
Forecasts of students at each level were obtained from Equation (21) and 
expected numbers attending were then calculated from (31). The results of 
these calculations are recorded in the top row of each division; actual 
historical figures are recorded beneath them. 
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